Material to A note on oligonucleotide expression values not being normally distributed
نویسندگان
چکیده
Motivation: Novel techniques for analyzing microarray data are constantly being developed. Though many of the methods contribute to biological discoveries, inability to properly evaluate the novel techniques limits their ability to advance science. Because the underlying structure, or distribution, of microarray data is unknown, novel methods are typically tested against the assumed structure of normally distributed data. However, microarray data are not, in fact, normally distributed, and testing against such data can have misleading consequences. Results: Using an Affymetrix technical replicate Spike-In data set, we showed that oligonucleotide expression values are not universally normally distributed under any of the standard methods for extracting expression values. The resulting data tend to have a large proportion of skew and heavy tailed values. Using data simulated under three models (normal, heavy tailed, and skewed), additionally, we showed that standard methodologies (for differential expression and gene similarity) can give unexpected and misleading results when the data are not normally distributed. Robust methods should be used when analyzing microarray data. Additionally, when evaluating new techniques, skewed and/or heavy tailed data distributions should be considered in simulations.
منابع مشابه
A note on oligonucleotide expression values not being normally distributed.
Novel techniques for analyzing microarray data are constantly being developed. Though many of the methods contribute to biological discoveries, inability to properly evaluate the novel techniques limits their ability to advance science. Because the underlying distribution of microarray data is unknown, novel methods are typically tested against the assumed normal distribution. However, microarr...
متن کاملOligonucleotide microarray data are not normally distributed
ABSTRACT Motivation: Novel techniques for analyzing microarray data are constantly being developed. Though many of the methods contribute to biological discoveries, inability to properly evaluate the novel techniques limits their ability to advance the science. Because the underlying structure, or distribution, of microarray data is unknown, novel methods are usually tested against the known st...
متن کاملEffect of Alccofine on Mechanical and Durability Index Properties of Green Concrete (TECHNICAL NOTE)
In the modern era, many research works are being carried out throughout the world for finding out a suitable cementitious material for the replacement of cement. The supplementary cementitious materials (SCM) can be used as a replacement of cement in the construction industry to minimize the carbon dioxide emission which is implicated in global warming and climatic changes in the environment. T...
متن کاملComparison of background correction and normalization procedures for high-density oligonucleotide microarrays
Oligonucleotide microarrays are now becoming a widely used research tool in gene expression analysis. A large variety of preprocessing methods for raw intensity measures is available to establish per-gene expression values. For their evaluation, a small number of spike-in and dilution data sets has been published. However, calibration data sets with varying parameters such as percentage of diff...
متن کاملClassification of oligonucleotide fingerprints: application for microbial community and gene expression analyses
MOTIVATION Oligonucleotide fingerprinting of ribosomal RNA genes (OFRG) is a procedure that sorts rRNA gene (rDNA) clones into taxonomic groups through a series of hybridization experiments. The hybridization signals are classified into three discrete values 0, 1 and N, where 0 and 1, respectively, specify negative and positive hybridization events and N designates an uncertain assignment. This...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009